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■ Abstract. In this paper we study the problem of testing the null hypothesis that 

errors from k independent parametrically specified generalized autoregressive con- 
O I ditional heteroskedasticity (GARCH) models have the same distribution versus a 

general alternative. First we establish the asymptotic validity of a class of linear 



test statistics derived from the k residual-based empirical distribution functions. A 
distinctive feature is that the asymptotic distribution of the test statistics involves 
terms depending on the distributions of errors and the parameters of the models, 
■ and weight functions providing the flexibility to choose scores for investigating power 

performance. A Monte Carlo study assesses the asymptotic performance in terms 
of empirical size and power of the three-sample test based on the Wilcoxon and Van 
J2 ' der Waerden score generating functions in finite samples. The results demonstrate 

that the two proposed tests have overall reasonable size and their power is partic- 
ularly high when the assumption of Gaussian errors is violated. As an illustrative 
example, the tests are applied to daily individual stock returns of the New York 
Stock Exchange data. 
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> : 1 Introduction 

. Analysis of volatility in financial time series is certainly the subject of considerable atten- 
', tion with huge literature having been published. In the seminal papers by Engle (1982) 
and BoUerslev (1986), generalized autoregressive conditional heteroskedasticity (GARCH) 
models have been proposed to capture special features of financial volatilities. Since then, 
numerous variations and extensions of GARCH models have been proposed to possi- 
bly explain and model risk and uncertainty in pricing derivative securities, in stochastic 
modelling of the term structure of interest rates, in applications related to fixed-income 
portfolio management, in asset pricing studies, and in the riskiness of financial returns 
which provides a volatility measure that can be used in financial decisions concerning risk 
analysis. Several excellent surveys of the GARCH methodology in finance are available, 
such as Bollerslev et al. (1992), Engle (1995), Gourieroux (1997), Mikosch (2003) and 
Bauwens et al. (2006). 

For time series data, residuals must be taken into account as they typically depend on 
parameter estimates, and inference based on these residuals, especially various diagnostic 
checks, is a basic tool in the statistical analysis of linear time series models (see Brock- 
well and Davis (1994)). By contrast, asymptotic theory for the residuals of nonlinear 
time series models has been surveyed by Berkes and Horvath (2002). For a GARCH(p, q) 
model, Berkes and Horvath (2003) derived the asymptotic distribution of the empirical 



process of residuals and showed that, unUke the residuals of autoregressive moving av- 
erage (ARMA) models, these residuals do not behave in this context like asymptotically 
independent random variables, and the asymptotic distribution involves a term depending 
on the parameters of the model. 

The classical two-sample problem is one of the central themes of nonparametric test- 
ing theory. One of the problems most frequently encountered in statistics is to test the 
hypothesis of no difference between two independent populations primarily on the basis 
of samples drawn at random from these two populations. Some of the earliest and most 
classical tests of nonparametric nature for this problem are Wilcoxon's test, the Mann 
and Whitney test, the Mood and Brown test, Lehmann's test, the Cramer-von Miscs 
test and Van der Wacrdcn's test. Moreover, the classical limit theorem of normalized 
two-sample linear test statistics which generated much interest in this context is the cel- 
ebrated Chernoff— Savage (1958) theorem. It is well known that the theorem is widely 
used to study the asymptotic power and power efficiency of the above two-sample tests. 
Further refinements on their conditions of this theorem, extensions and related results, 
are due to Durbin (1973), Puri and Sen (1993) and references therein. 

The natural extension of the two-sample problem is the fc-sample problem, where 
observations are taken under a variety of different and independent conditions. The non- 
parametric test procedures which have been developed for this /c-sample problem require 
no assumptions beyond continuous populations and therefore are applicable under any cir- 
cumstances. The classical tests in this context are the Kruskal-Wallis H test, Terpestra's 
fc-sample test, the Mood and Brown fc-sample test, Kiefer's /c-sample analogues of the 
Kolomogorov-Smirnov test and the Cramer-von Mises /c-sample test. To this end, it is of 
interest to state that Puri (1964) generalized the situation covered by the Chernoff— Savage 
(1958) theorem to the /c-sample problem. 

If GARCH errors were observable, the problem that we consider here would be the 
classical /c-sample problem studied by Puri (1964). In our context, we do not observe 
these errors, but assume that well-behaved estimators of the parameters of the model are 
available. Hence, our test procedure can be thought of as an extension of the A;-sample 
problem. More specifically, we are concerned with testing the null hypothesis that errors 
from k independent parametrically specified GARCH models have the same distribution 
versus a general alternative in the spirit of Chernoff and Savage (1958), Puri (1964), and 
Berkes and Horvath (2003). In contrast with the independent, identically distributed 
or ARMA setting, this study highfights some interesting features of k GARCH residual- 
based test statistics. 

Potential applications of the fc-sample test are to be found especially in studies of 
the behavior of speculative prices, such as stock prices or exchange rates, usually in view 
of testing market efficiency. One important problem, for example, the stock return of a 
company is defined as the error from a GARCH model, and the researcher is often in- 
terested in comparing the distributions of stock return of companies from k independent 
groups. Another related problem in this context is that the researcher may be interested 
in comparing the distribution functions of standardized real variables like exports or out- 
put growth rates with data from k independent companies. In other areas of financial 
markets it is often of interest to test whether k observable variables belong to the same 
location-scale family, which is also a special case of the test that we study. In all these 
situations, the usual approach to test for the equality of the distribution functions is to 
test the equality of just some moments to propose parametric models for the errors and 
then test whether the parameters estimated are equal. Instead, we propose to compare 
the entire distribution functions without assuming any parametric form for them. 



The objective of this paper is to study the asymptotic behavior of k GARCH residual- 
based hnear test statistics. The rest of the paper is organized as follows. Section 2 
introduces the construction of k GARCH residual-based empirical distribution functions 
and proposes linear test statistics pertaining to these residual-based empirical distribution 
functions. In Section 3, we establish the asymptotic validity of the test. Section 4 reports 
the results in terms of empirical size and power of a Monte Carlo study for validating the 
three-sample test based on the Wilcoxon and Van der Waerden score generating functions 
for finite sample sizes. As an example, the two tests are applied to daily individual stock 
returns of the New York Stock Exchange data. The proof of the result in Section 3 is 
provided in Section 5. 



2 k GARCH residual-based linear test statistics 

In this section, we propose a family of linear test statistics pertaining to empirical pro- 
cesses of residuals in order to test the null hypothesis that errors from k parametrically 
specified GARCH models have the same distribution against a general alternative. We 
shall formulate the /c-sample problem as follows. Let us consider the k independent ran- 
dom samples generated from the GARCH (pj, qj) models given by 

Pj Ij , (-]) 

^It - ^oj + j:»ijXi_,+ z(^iAt-i'^ i<t<nj, i<j<k, ^' 

i=l i'=l 



where the Sj^t are independent and identically distributed random variables such that 



-^(^L) ~ "^oj > 0, ctL > 0, 1 < i < Pj, /3q- > 0, 1 < i' < g^, and the e^^t is independent 



of s, s < t. Henceforth, it is tacitly assumed that a^j > when pj > 1, and /S^j > 
when qj > 1. 

In this paper, we are primarily concerned with the A;-sample problem of testing 

Ho:Fi{x)^---^ Fk{x) for all x 
against (2) 

Ha '■ Fi{x) 7^ Fj{x) for at least some x, and i ^ j, 

where Fj{-) is the distribution function of {sj^t}, which is assumed to be absolutely con- 
tinuous with respect to the Lebesgue measure, but unspecified. Henceforth, we assume 
that fj{x) = F-{x) exists and is defined over (— oo, oo). 

We first proceed to describe the quasi-maximum likelihood (QML) estimation of model 
(1). The vector of parameters is 9j = (6'ij, . . . , Oj^p.+g^^i)'^ = {uj, a], . . . ,a^/ , Pj, . . . , fSff 
which belongs to a compact parameter space 0^ C (0, oo)-' x [0, cxo)Pj+*. The true vector 
of parameters is unknown and is denoted by O^j = {ujQj, a^j, . . . , aQj, (3^^, . . . , ^qj)'^- 

Suppose that an observed stretch X,-_i, . . . , Xj^nj from {Xj^t} is available. Note that if 
{sj^t} is Gaussian, the quasi-likelihood function with respect to initial values Xj q, . . . , Xj^i^p., 
^|,o>--->^|,i-9,o is given by 

% 1 X"^ 
where the cr| j, t > 1 are defined recursively by 

Pj Qj 



As an example, one can choose the initial values as X|q = ■ ■ ■ = = cr|o = • • • = 

^li-qj = '^j or = ■ ■ ■ = Xl^-p^ = alo = --- = a|i_g. = 
We can now define the QML estimators of 6j by 

= arg maxL„.(^j) = arg min 

where 

For ^j,nj , it is assumed that 

\An,-OoA\ = Op{nj'/'), l<j<k, (3) 

where || ■ || denotes the Euclidean norm. The validity of (3) is established by Francq 
and Zakoi'an (2004) based on the conditions of Assumption 2 given below. Conditions 
(3) are also typically satisfied by the QML estimators of Straumann and Mikosch (2006). 
Henceforth, the empirical residuals are given by 

= Xj^t/^t{Oj,n^), l<j<k. 

For (2) , we first collect some basic tools and then describe our approach in the spirit 

of Chernoff and Savage (1958), and Puri (1964). Write N = Yl^=i''^j ^'^d Xjn = nj/N, 
1 ^ J ^ k- In the following, we assume that the inequalities < Aq < Aijv, . . . , A^jv < 
1 — Ao < 1 for some Aq < 1/k. Define by 

k 

the combined cumulative distribution function. Write Fj^njix) = nj^ Y^^=i[H^j,t ^ x)] 
and Fj^nj{x) = nj^ Ylt=i[H^j,t ^ ^)]) where is the indicator function of the event fl. 
Then the empirical distribution function is 

k 

and analogously, 

k 

'nN{x) = Y,\NFj,n,{x). (4) 

Set Bj^njix) = ny^{Fj^nj{x) - Fj{x)). Then by virtue of Berkes and Horvath (2003), it 
follows that 

Bj,nj (x) = <Sj- „ . {x) + AjXfj {x) + „ . {x) , (5) 

where sup^ \Cj,nj{x) \ = Op(l), 

rij 

£j,n,{x)^n-'/'J2^I{ej,t<x)-Fj{x)], A, ^ n/"{0'j,n, - 01j)tj,i 

t=i 1=1 



with 

T,^, = E[l/2~al{e,,)], Tj,i = E[X%_j2~a^,{e,,)l 2 < / < p, + 1, 

and Tj^p^.+i+i/ = E[al_ii{0Qj)/2al{0oj)], 1 < /' < g^, I < j < k. Hence, by analogy with 
(5), the asymptotic representation of (4) becomes 

k 

Hn^x) = Hn{x) + ^n7'/'A,wAx/j(a;) + Op{N-^'^). (6) 

Decomposition (6) is basic and plays an important role in the sequel. 

Define 5"^^^ = 1, if the ith smallest oi N — Y^^j=i empirical residuals is from {ij^t}-, 

and otherwise define S'fjl = I < i < N , I < j < k. Then, for (2), we shall consider a 
family of linear test statistics of the form 

N 



•' 1=1 



where the E'jjv are given constants called weights or scores. The definition of Tj^ is the 
one traditionally used. We shall, however, use the representation given by 

/./V 
J{J^nN{x)yF,,r^,{x), 1 < J < k, (7) 

where J{u), < rt < 1, is a continuous score-generating function. Note that Ei^ — 
J(i/{N + 1)), 1 < i < are functions of the ranks i (= 1, . . . , A^) and are explicity 
known. Some typical examples of J given in Puri and Sen (1993) are as follows: 

(i) Wilcoxon's /c-sample test with J(u) — u, < u < 1, 

(ii) Van der Waerden's /c-sample test with J{u) = < m < 1, where $(a;) ~ 

(iii) Mood's /c-sample test with J{u) = {u — |)^, < ti < 1, 

(iv) Klotz's normal /c-sample test with J{u) — ($~-'^(it))^, < u < 1. 

In the following, K will denote a generic constant taking many different values K > 
which may depend on J but will not depend on Fj{-), rij and N for all 1 < j < A;. 

3 Asymptotic properties of Tjn 

In this section, our primary object is to show that (7) has an asymptotically normal 
distribution. For this purpose, let {Aoj,t, I < j < k} he the {pj + qj) x {pj + qj) matrices 
defined by 



o-i ••• c/'' fy^' ■■■ 3'^' 

V Q(qj-l)xpj Iqj-l Ofe-l)xl / 



Assuming that 

E(log+||Ao,-i||) <^||Ao,,i|| < oo. 



the top Lyapunov exponent is defined by ^(Aoj) = inf(>i t^^E{\og || Aoj,iAoj,2 ■ ■ ■ Aoj,t||), 
where Aq^ = {Aoj,t, 1 ^ j < In particular, one can readily check that if {cj^t} is 
Gaussian, (8) holds. Bougerol and Picard (1992a,b) showed that if (8) holds, a general 
GARCH(pj, gj) process has a unique non-anticipative strictly stationary solution if and 
only if7(Ao,) <0, 1 < j < k. 

To establish the asymptotic properties of (7), we impose the following regularity con- 
ditions. 

Assumption 1 

(A.l) J{u) is not constant and has a continuous derivative J'{u) on (0,1). 

(A.2) \J{u)\ < K[u{l - 'u)]-5+<5 and \J'{u)\ < K[u{l - u)]-i+^ for some 5 > 0. 

(A. 3) xfj{x) and xf!j{x) are uniformly bounded continuous, and integrable functions on 
(— oo, oo). 

(A. 4) There exist constants Cj > such that Fj{x) > Cj{xfj{x)} for all x > 0. 

A few remarks concerning the necessity of these conditions are in order. Assumptions 
(A.l) and (A.2) arc basic conditions in our context. As noted by Chernoff and Savage 
(1958), typically (A.2) has two important functions: (i) it limits the growth of the function 
J and (ii) it supplies certain smoothness properties. Both conditions can be easily verifi- 
able in the preceding examples given by J. Assumption (A. 3) is basic and necessary for 
studying residual empirical processes and establishing the convergence result of (7). This 
condition was also made for empirical processes pertaining to linear regression residuals 
by Bai (1996). Assumption (A. 4) is virtually imposed in dealing with the convergence of 
higher order terms of (7). Finally, it is worth noting that conditions (A.l)— (A. 4) are typ- 
ically satisfied by several error distributions such as, normal. Student's t, logistic, double 
exponential, gamma and Laplace. 

To validate (3), we require the following additional regularity conditions, which can 
be found in Prancq and Zakoian (2004). 

Assumption 2 

(B.l) Oqj e 6j, where Qj denotes the interior of the compact parameter space Qj. 

(B.2) 7(Aoj) < and EtUPoj < 1 for all 9j e 9,-. 

(B.3) e'j f has a non-degenerate distribution with -E'(£^ J = 1. 

(B.4) = E{sl^ < oo. 

(B.5) If qj > 0, Ag^^.{z) and Bq^^^z) have no common root, Agg^, (1) ^ 0, and + /^q^ ^ 0, 
where Ae^.{z) = YlUaiz' and Bq^.^z) = l-Y.lLiPj z'' . Conventionally, A0^.{z) = 
if pj = and Bq^.{z) = 1 if qj = 0. 

We now justify that conditions (B.l) — (B.5) are necessary for the model under con- 
sideration. These conditions were essentially made by Francq and Zakoian (2004) for the 
validity of (3). We first note that the compactness of Qj is always assumed. 

Assumption (B.l) is typically necessary to obtain the asymptotic normality of the 
QML estimators Oj^nj, ^ ^ j ^ k. In the case of a^j = ctoj = 0, the limit distribution 



of ^JfCjipLj — aoj) is non-normal over [0, cxd). Assumption (B.2) is a sufficient condition 
for the stationarity and ergodicity of model (1). This condition implies that the roots of 
Bq.{z) are outside the unit disc. Moreover, if 7(Aoj) < 0, there exists s > such that 
Eiaf^ < oo and £'(X|^) < oo. Assumption (B.3) is made for model identification is not 
restrictive provided < oo. This moment condition is clearly necessary to estab- 

lish the asymptotic normality of the Gaussian QML estimator as in Berkes and Horvath 
(2003). The existence of a fourth-order moment given by (B.4) is a strengthening of (B.3) 
required for the finiteness of the variance of the score vector dlt{9Qj) / d9j. Note also that 
this condition does not imply the existence of a second-order moment for the observed 
process {Xj^}. It is often the case that the existence of the second-order moments is 
found to be inappropriate for financial applications. 

Finally, the assumption that the polynomials whose common roots uniquely identify 
9j was also made by Berkes et. al (2003). This condition is typically satisfied when pj > 1 
and Qj > 1. If pj = 1 and aoj ^ 0, the unique root of Agg.{z) — and Bg^.lz) ^ 0. If 
qj = 1 and f3oj = jS^j ^ 0, the unique root of Bg^^.{z) = l//9oj > 0, and because a^j > 
produces Ag^j^l/Poj) ^ 0. Moreover, it can be noted that (B.5) implies that Q^j does not 
necessarily have to belong to the interior of 0j. This is essentially important when dealing 
with situations of over-specification. When a GARCH(pj, q^) is fitted, one can show that 
an ARCH(pj) model can be estimated consistently. In a general sense, either pj or q^ can 
be over-specified, but not both of them. Indeed, it is required that «q^- > for some i 
when Pj > 0. If this assumption is dropped, the model solution would simply reduce to 
an i.i.d. white noise of the form (t|^ = (t|(1 — (5oj) + PojO'j^t-iJ where cr| = ujoj/ (1 — (5oj). 

In order to state the main result, we shall introduce the following notation: 

"^^0^) - ^ ^4/.., JIaF- ' M9j) - o,., , l<J<k. 



at{9oj) 89, d9j i a^{9,) 89 



By virtue of (B.4), it is seen that the ith element of each 9j^nj, ^ < j < k admits the 
asymptotic representation, 

rij 

^ t=i 

where Zl{9j) is the ith element of [V{{9(}j)]^^Ut{9j). 1 < i < pj + qj + l. As shown by Francq 
and Zakoian (2004), Uifioj) is positive definite for all 1 < j < /c. These considerations 
motivate the following result, whose proof is relegated to Section 5. 

Theorem 1. Suppose that Assumptions 1 and 2 hold and that, in addition, {^j,n } is 
a sequence of QML estimators typically satisfying (3). Then, as N ^ oo, 

where 1^ is the k x k identity matrix, Sjv the k x k positive definite dispersion ma- 
trix whose entries are given by (15) and (16), and sn = {Tjn — IJ'jN)i<j<k with /ijN = 
I J{HN)dF, 



X) 



Remark 1. If J(-) and S^v were known, an immediate consequence of Theorem 1 is 
that the quadratic statistic = A^s^S^^s^v has an approximate x^(^) distribution with 
k degrees of freedom under Hq (cf. Theorem 2.8 in Seber (1977)). Unfortunately, the 
covariance structure of Ejv, in general, depends on the unspecified distribution function 



Fj{-), the unknown parameter vector Ooj and some expectations. Thus, it is not possible 
to perform a consistent test based on £j^. Replacing S^r by a consistent estimator E^v 
(for details, see Section 4), we can effectively estimate jCn by — NsJjII^^Sn. Writing 
sn — N^^'^{TjN — l^jN)i<j<k-i we have Cn/Cn — sJf^N^SN/sJfT^'^^SN, and using Lemma 1 
given in Section 5, it follows that chfe(EjvSjv'^) < {^n/^n) < chii^N^N^), where chj(A) is 
the jth characteristic root of A. Moreover, by the ergodic theorem wc have SjvS^^ ^ I^, 
which implies chi(SArS];^^) 1 and chkCEj^J]]^^) ^ 1. Observing that C^/Cn 1, and 

writing Cn = jCn x {Cn/Cn) we may conclude from Slutsky's theorem that Cn X^(^) 
under Hq, as was to be proved. 

4 Simulation and empirical studies 

In this section we study the finite sample performance of the proposed test procedure 
by means of a simple numerical experiment and an empirical example. The ideal way 
to carry out the former case would be first to generate data from some specific GARCH 
model, and then estimate a GARCH model either correctly specified or not and check the 
asymptotic behavior of in terms of empirical size and power. 

For simplicity and clarity, we shall consider three-independent random samples gen- 
erated from the GARCH (1,1) model 

Xj,t = <Jt{Oj)ej,t: a^{9j)^u;j + a,Xl_,+Pjal,{9j), l<t<nj, 1 < j < 3, (9) 

where the ej^t are independent and identically distributed random variables such that 
E{e^j f) = 1, 9j — {ujj, aj,l3j)'^, cuj > 0, aj > 0, l3j > are unknown parameters, and the 
ej,t arc independent of Xj^g, s < t. Note that model (9) is the most commonly used in the 
literature, and enjoy substantial application in the finance setting. 

In the following, we are concerned with the three-sample problem of testing 

Ho : Fi(-) = F^i-) = Fs{-) against Ha : F,{-) ^ F^i-) + Fs{-), 

where Fj{-) is an absolutely continuous distribution function of {£j,t}, but unspecified. 
For testing Hq, we propose to use the statistic = A^s^S^^Sat, which has an approxi- 
mate Xr(3) distribution with 3 degrees of freedom and < r < 1 is the preassigned level 
of significance. 

We now describe our goodness-of-fit test using a smoothed bootstrap procedure. To 
this end, note that the asymptotic distribution of Tjj^ depends crucially on the assumption 
of continuity and hence bootstrap samples must be generated from continuous distribu- 
tions. The following steps provide an explicit description of the bootstrap test procedure 
based on C^: 

1. Having observed Xj^i, . . . , Xj^^p obtain an estimate Oj^n^ — {cuj, aj, (3j)^ of 9j using the 
QML method described in Section 2. 

2. Generate B independent sequences of i.i.d. standard normal random variables with 
replacement, each of length rij + uq, where no is the length of warm-up sequence 
to reduce the effect of initial conditions. Then define each of the B sequences by 

3. Generate B bootstrap GARCH(1,1) independent realizations X*i,...,X*,^. with re- 
placement, where the X*^, by analogy with (9), satisfy 



Note that {X*^} is a smooth bootstrap version of the sample {Xj^t}- 

4. For each of the B samples X*^, . . . ,X*^j^., obtain an estimate 9*^^. = (tD*, a*, P*Y of 
Oj^rij and construct bootstrap empirical residuals 

^It - ^It/^^j + + P*a;l,{e*^^), t^2,...,nj, l<j<3. 

5. For the score generating functions J{u) = u (Wilcoxon) and J{u) = (Van der 
Wacrdcn), evaluate the following integral by a rectangular numerical integration with 
m terms: 

T;n = J j{j^n*Ax))dFlnM), 1<J<3, 

where F*n.{-) denotes the empirical distribution function constructed from {s*^^ and 
is the bootstrap version of (6). Then, for each of the B residuals {Sj^j}, calculate 
= Ns*^Ti*^^s*j^, where = (T7iv~A*jJv)i<j<3 and is a resampled version of Ejv- 

6. Finally, repeat step 5 B times and then reject Hq with significance level r if the p- value 
r = P{C*N > ^*n) < where C*^ is the 1 — r sample quantile from {C*j^^j^^=i- Here B 
is chosen to be a sufficiently large integer. 

In what follows we test the null hypothesis that the zero-mean unit-variance errors 
have the same distribution function at the 5% significance level. For this purpose, we 
shall consider two data generating processes (DGPs): 

Xj^t = '^t{Oj)£j,t, 

DGPl: a^{9,) = 0.1 + 0.1 + 0.1 a^il^,), 

DGP2: a^iOj) = 0.5 + 0.4 X^,., + 0.4^2.^ (^,), 1 < j < 3, 

where the ei^t are i.i.d. random variables with an A/'(0, 1) distribution, the £2,* are i.i.d. 
random variables with mixture distribution (1 — Lp)N'{0, 1) + v^A/'(2, 1), < (/? < 1 and 
the £3,4 are i.i.d. random variables with Student's t distribution having (p~^ degrees of 
freedom. The values of (p that we consider are (p G {0, 1/9, 1/5, 1/3}. Note that if </? = 0, 
the errors £2,* and e^^t are generated from a standard normal distribution. The choice 
of (/? values, in principle, indicates that the last two error processes have a leptokurtic 
distribution whose tails are heavier than the ones of a normal distribution. Observe that 
Hq holds true if and only if = 0. We also notice that the parameter 99 represents the 
departure from A/"(0, 1) in the sense that the larger the value of </?, the larger the deviation 
from the null model. Here, the distributions of interest are re-scaled such that they have 
the required zero mean and unit variance. 

We generate repeated trials of lengths rii = n2 = & {100, 300, 500} from DGPl and 
DGP2, and compute the empirical size and power of the 3— sample bootstrap Wilcoxon 
(W) and Van der Waerden (VdW) tests at the 5% nominal level based on the steps 1—6 
for each trial. The number of Monte Carlo trials is 10000 with B — 1000 bootstrap reph- 
cations each. Each configuration of parameters was estimated by the QML method. 

Table 1 reports the empirical proportion of rejections of Hq for the W and VdW tests 
based on the corresponding asymptotic Xomi'^) distribution. For the sake of brevity, we 
do not include the results for Mood's and Klotz's normal tests, which are quite similar. 



From Table 1, it can be seen that the values are stable with respect to the choice of sample 
sizes and parameters. We noted in our Theorem 1 that the empirical rate of convergence 
of the normalized random variable H^^'^sn to the /c-variate normal distribution A/'(0, Ik) 
depends on the parameters of the GARCH process. The smaller the parameters aj and 
(3j, the faster the convergence. This is intuitively clear because larger values of aj and 
f3j imply not only more dependence, but also heavier tails of the error distributions (cf. 
Basrak et al. (2002)). More specifically, we observe that the power of the tests for the 
DGP 1 is generally higher than that for the DGP 2 with respect to the sample sizes. 

Overall, the two bootstrap-based statistics perform reasonably well in terms of em- 
pirical size and power, and none of them provides an obvious answer to the question of 
what test statistic should be preferred. Therefore, in practice we cannot know in advance 
which of them would lead to a more powerful test. Moreover, as the sample sizes and 
(p increase, the size of both the tests converge to the theoretical level and their powers 
generally increase. When the error distributions are sufficiently different, the power of 
the tests is adequate for three different choices of the sample size. It is worth noting that 
the highest power of such tests is attained at = 1/3. 

Table 1: Proportion of rejections of Hq for the bootstrap W and VdW tests at r = 5% 



DGP 1 





rii = n2 


= 723 = 100 


ni = n2 = 


713 = 300 


ni = n2 


= 713 = 500 


W 


VdW 


W 


VdW 


W 


VdW 





0.045 


0.046 


0.052 


0.053 


0.049 


0.051 


1/9 


0.122 


0.123 


0.171 


0.169 


0.214 


0.221 


1/5 


0.321 


0.331 


0.412 


0.401 


0.785 


0.788 


1/3 


0.821 


0.818 


0.861 


0.871 


0.913 


0.912 


DGP 2 





0.041 


0.042 


0.056 


0.055 


0.045 


0.046 


1/9 


0.102 


0.104 


0.151 


0.148 


0.193 


0.195 


1/5 


0.313 


0.314 


0.393 


0.401 


0.712 


0.717 


1/3 


0.801 


0.796 


0.815 


0.817 


0.897 


0.894 



We conclude this section with a simple empirical example based on daily data. For 
this purpose, we apply the bootstrap W and VdW tests to the series of residuals obtained 
from the estimation of a GARCH(1,1) on series of daily individual stock returns for the 
three companies (i) AMOGO, (ii) FORD and (iii) HP listed on New York Stock Exchange. 
Each series starts from July 3, 1962, to December 31, 1991 with 7420 observations. In our 
analysis, however, we consider the last 2000 data points from each series from February 
2, 1984, to December 31, 1991. 

Table 2 displays the empirical proportion of rejections of Hq for the W and VdW tests 
at the 5% significance level. The result shows that the tests have similar desirable size 
and power at the 5% level. To this end, the results provide enough evidence in support 
of the simulation results. For all the three considered series, the hypothesis of normality 
of the error distributions is rejected at the 5% level. The bootstrap tests we studied in 
this paper have reasonable size and can detect a misspecified probability distribution of 
the errors in a GARGH model with high probability. 



Table 2: Proportion of rejections of Hp for the bootstrap W and VdW tests at r = 5% 








1/9 


1/5 


1/3 


w 


0.050 


0.616 


0.981 


1.000 




0.019 


0.618 


0.978 


1.000 



5 Proof and Auxiliary Lemma 

In this section we provide Lemma 1 and the proof of Theorem 1. Lemma 1 is useful for 
ordering characteristic roots of a product of two matrices (see e.g., Sen and Singer (1993)). 

Lemma 1 (Courant). Let U and V be positive semi-definite matrices. Suppose that 
V is nonsingular and that x = {xi, . . . , Xk)'^ G {—oo, oo)^ is a characteristic vector. Then 
if the product UV^^ is well defined, and if Vi denotes the ith characteristic root of UV~^ 
for i = 1, . . . , k, we have 



chkiUV-^) ^Vk^ inf 



x'^Ux 



X x^Vx 

Next we provide the proof of Theorem 1. 



< sup '■ 



x^Ux 
x'^Vx 



Proof of Theorem 1. Write dFj^^. = d^Fj^n^ - Fj + Fj) and 

Hn 



+ 



J 



N 



-n 



N 



Then the decomposition of (7) is given by 



-J{H 



N 



N+1 

-Hn — Hn ) J'{Hn) 



where 



J-jN 

BlN,j 
B2N,j 
CiN,j 
C2N,j 



HjN + Bin J + B2N,j + Cinj + C2N,j + C^nj, 



J{HN)d{F,,^^-F,){x), 
j{UN-HN)J'{HN)dF^{x), 

-Jf^ I nNJ'{HN)dFj,n,{x), 

j {Hn - HN)J'{HN)d{Fj,n, - Fj){x), 



N 



To prove this theorem, it is necessary to show that (i) the vector N^^'^(Binj + 
B2N,j)i<j<k when properly normalized has a limiting Gaussian distribution, and (ii) 
the C* terms are uniformly of higher order. For (i), we observe that the difference 



j)i<j<k tends to zero in probability and so 
the vectors N^^'^iTjM — IJ'jN)i<j<k and N^^'^{Bi]sij + B2N,j)i<j<k possess the same hmiting 
distribution. 

Let us now proceed to show the statement (i). Prom (5), it is easily seen that 

BiN,j = J J{H^)d{Fj^n, - Fj){x) + nj'/'Aj j J{H^)d{xfj{x)) + 0^(1). (10) 
Integrating B2N,j by parts, and using (6) and (10), it follows that 

XiN / Bj{x)d{Fi^rii- Fi){x) 

i=l 

+ j {J{H^) - XjBj{x))d{Fj,r., - Fj){x) 
-rij^^^AjJ^^iN j xfjix)J'{HN)dFi{x) 

J2\Nn;'^'Ai I xU{x)J'{H^)dFj{x)) + Op{l) 



k 



1=1 



= ajN + hjN + CjN + djN + Op{l) (say), (11) 

where Bj{x) — p J'{HN)dFj{y) with xq determined somewhat arbitrarily, say by Hm{xo) - 
1/2. 

In what follows, we shall first evaluate the asymptotic variance of (11) and then the 
asymptotic covariance to construct the dispersion matrix Ejv- For this purpose, first 
consider qjn and write it as 

-7VV2A.^ J B^{x)d{F,,n,-Fi){x) 
= N^/^\n J{Fi^^^-Fi)J'{HN)dFj{x), i^l,...,j-l,j + l,...,k. 
Then the mean is zero and the variance is 

Fi)J'{HM)dFj{x)^ 

= eI^NX^j, JJ (F,,„,(x) - F,(x))(F«(t/) - F,{y))f{Hr,{x))f{H^{y))dFj{x)dFj{y)y 
= 2X,N jj F,{x){l - Fi{y))J'{HN{x))J'{HN{y))dF,{x)dF,{y). 



x<y 



Note that the application of Fubini's theorem permits the interchange of integral and 
expectation. 

By a similar argument, the variance of 

bjN = -A^'/' I (^^'-^ - F,)J\HN)dF,{x) 

i=i 



is given by 

k 



^T.^'nII FAx){l - F,{y))J\H^{x))J\H^{y))dF,{x)dF,{y) 



k 



2 

+ 



>^iN\'N II F,{x){l - Fj{y))J\Hi,{x))J\HM{y))dFi{x)dFi,{y) 

■•,1' -L I'^ll 



+ 



2 



XjN 



J2 ^^^^^'^ JJ ^MC^ - F^{x))J'{HM{x))J'{HM{y))dFi{x)dFi,{y). 



Therefore, by observing that ajN and hjN are mutually independent variables, it follows 
by the result of Puri (1964) that 



= 2|J]A,jv jj T,M{x,y)dF,{x)dFj{y) 



k 

4 



+ A^E^^^ jj ^jN(x,y)dFi(x)dFi(y) 



k 



+^ ^ XiNK'N<^jj TjN{x,y)dFi{x)dFi,{y) 

^ (12) 



y<x 



where Tjj^{u,v) = Fj{u){l — Fj{y))J\H]s[{u))J\H]si{y)). To evaluate the same for Cjjq 
and djN, recall the result of Prancq and Zakoi'an (2004) that 

Var{n]'\d^,n, - ^o,)) = («,■ - l)[W(^o,)]-\ 1 < J < /c. 
In view of (5) and (11), it follows that 

o-27v,ii = Var{cjN) = (fij - l)u}jN[U{9oj)]~^uJjN, (13) 

k 

where ujjn = -A";^ Z) \n J xfj{x)J'{HN)dFi{x) x r,- with Tj = (t,-,i, . . . ,Tj^p.+g.+i)'^ , 

i=l 

and analogously 

k 

cr3N,jj = 1^ar(c?jjv) = ^ii^i - ^)i^m[^i^oi)]~^i^iN: (14) 



where i/jjv = X]l^ J xfi{x)J'{Hj^)dFj{x)XTi. Moreover, by independence of Xj^i^ . . . , Xj^rij, 
it remains to evaluate 

KiN,j = 2E{ajNdjN) and Xsivj = 2E{bjNCjN)- 



Using (11), we obtain 

k 

i=i •'•^ 

where ipiN{u,v) — vfi{v)J'{HN{u))J'{HN{v)). To obtain an explicit expression of Kipfj, 
it is necessary to evaluate E[-]. Prom the result of Berkes and Horvath (2003) and (5), 
we find that 

Pi+Qi + i 

E[ny'{F,,^,{x)-F,{x))A.]^ J2 n,ih\{^), 

1=1 

where h\{v) = 5l Jl^{u^ - l)fi{u)du with 5\ = E{Zl{eoi)), l<i<k. Then 

k Pi+qi+1 „„ 

KiNj^2^ ^ XiNTi^i hl{x)ipiN{x,y)dFj{x)dFj{y), 

i=l 1=1 •'•^ 

and similarly 

i^2JVj = T— V V Xf^Tj^i h^j{x)il:jN{x,y)dFi{x)dFi{y). 

^jN^l ^ J J 

Therefore, the variance terms when combined yield 

^N,jj = O-iNJj + 0-2N,jj + 0-3N,jj + lN,jj, (15) 

where j^jj = ^ijvj + ^2JV,i- 

We next turn to evaluate the covariance terms. For this purpose, rewrite (11) as 

= N'/^J2^^^{- UF^,n,{x) - F^{x))J\Hj,)dFi{x) 
i=i 

+ j{F^}{x)-F,{x))J\H^)dF^{x)-nf^A, J xf^{x)J\H^)dF,{x) 

j xU{x)J\HN)dF,{x)) + 0^(1) 
= aiN,j + hiN,j + ciN,j + diN,j + Op{l), (say). 
By independence of Xj^i, . . . , Xj^n., 1 < j < k, we first compute 

ClAfji' = Cov{aiiyj + fclATj, aiN,j' + b2N,j') 

= E{aiN,jbiN,j') + E{biN,jaiN,j') + E{biN,jbiN,j'), j f = I, . . . ,k. 

From 

oiivjfeiivj' = 5^ '^iJvAzAT // (i^j,n, (a:^) - Fj{x)){Fi^ni{y) - Fi{y)) 
i=i 1=1 '''' 
xJ'{HN{x))J'{HN{y))dF,{x)dF,,{y), 



it follows by using again the result of Puri (1964) that 

k 

E(a,N,M,j,) = -E^^^ jj Fj{x){l-Fj{y))J'{H^{x))J'{Hj,{y))dF,{x)dFj,{y) 

x<y 

k 

II F,{y){l-F,{x))J\Hn{x))J\Hj,{y))dF,{x)dF,,{y). 

i=l ''J 



y<x 

In the same way, we have 

k 



E{hN,ja^N,j') = -J]A,jv [f Fj,{x){l- Fj,{y))f{HN{x))f{Hj,{y))dFi{x)dFj{y) 

' x<y 
k 

-^A,^ // FAy){l-FAx))J'(Hr,(x))J'(HM)dFi{x)dFj(y) 



y<x 

and 



k 

E(hN,M,r) = X^A^AT JJ F,{x){l- F,{y))J'{H^{x))J\HN{y))dFj{x)dFj,{y) 

x<y 

k 

+ If Fi{y){l-Fi{x))J'{HN{x))J'{HN{y))dFj{x)dFj,{y). 



y<x 

Therefore, 



f^iNjj' = -y^.^iNjJJ TjN{x,y)dFi{x)dFy{y)+ JJ TjNiy,x)dFi{x)dFj>{y) 

x<y y<x 
k . . 

-J2^^^{JJ ^j'N{x,y)dFi{x)dFj{y) + JJ T j,N{y,x)dF,{x)dF^{y)) 

x<y y<x 

+ J2^^^{JJ ^rN{x,y)dFj{x)dFj,{y)+ JJ T,r,{y,x)dFj{x)dFj,{y)y 

i=l 



x<y y<x 

Now we turn to evaluate, for j ^ j', 

LiNjj' — E{aiNjdiNj') + E{diNjaiNj') and L2N,jj' — E{biNjCiN,j') + E{ciNjbiN,j')- 
In analogy with the preceding terms, we have 

k Pj+qj+i 



LiNjj' = -^^iN^ ^^'^ // h]{x)tpj{x,y)dFi{x)dFy{y) 

i=l 1=1 '''' 

+ Yl ^i'.' JJ h],{x)Mx,y)dFi{x)dF^{y)) 



and 



1=1 1=1 
+ E jj h],{y)il^j'{y,x)dF,{x)dF^{y)^ 



1=1 z=i 

E 

1=1 

Therefore, combining the covariance terms produces 

a jvjjv = a iN,jj' + cr 2N,jj' , (16) 

where a2N,jj' = Li^jy + L2N,jj', j f ^ I, . . . ,k. 

Hence, using (13) — (16) and the central hmit theorems for martingale differences given 
by Berkes and Horvath (2003), and Francq and Zakoian (2004), we may conclude that 

N'/^E-'/\B,^j + B2N,j)i<j<k A^(0, 4) as TV ^ oo. 

Next, we turn to show statement (ii). For this purpose, we require the following 
elementary results (see Puri (1964)). 

(i) HM>\,NF,>\oFj, l<j<k. 

(ii) (1 - F,) < (1 - Hn)/X,^ < (1 - Hn)/Xo, l<j<k. 

(iii) Fj{l - Fj) < H^{1 - Hi,)/X]^ < H^il - Hn)/XI l<j<k. 

(iv) dHM > XjNdFj > XodFj, 1 < j < k. 

(v) Let {i^in,^2n) be the interval S'at, such that 

Sn, = {x : Hn{1 - Hm) > VeXo/N}, (17) 

where e > is arbitrarily small and r]e{> 0) depends e. Thus, 

< N{H^{^,^)), (1 - H^{^2n)) < Veil + N'\). (18) 
Hence, rj^ can be chosen independently of Fj and XjN in such a way that 

N{Hj,{i»,n) + (1 - Hn{i»2n))) < e. (19) 

Prom (19), it follows that 

P{ej,teSN^,l<t<n,,l<j<k) 

N 

^1[[Hj{^2n)-Hj{An)] 

N 

= - [H,{^,^) + 1 - H,{^2n)]} 

N 

>1-J2[HAAn) + 1-H,{^2n)] 
= 1 - N[H,,{^^N) + (1 - H^{^2n))] 

> 1 - e. (20) 



Let us first evaluate Cmj- By (6) and dPj^nj = d{Fj^n. — Fj + Fj), we have 

4 

XI C'li^j + %(^~^) ' (say) ■ 



-1/2 



1=1 



The proof of Cunj — Op{N ^/^) follows precisely the same arguments as in Puri (1964). 
Next we turn to Ci2N,j- By (A. 2) and (A. 3), we obtain 

k 

1=1 

k 
i=l 



In a similar fashion as the proof for Cunj, it follows that 

1 J \J'{Hr,)\dFj,^^{x)^o,{N-'/'), 
which, combined with the fact 

i=i i=i 
imphes Ci2N,j — Op{N~^). Next consider 

Ci3Af,i = -nj ^^"^AjiCl^Nj + Ci3^j), 

where 

= ]vTii nNJ'{H^)d{xf,{x)), 

Cr^N, = JvTi/c '^NJ\HN)d{xf){x)) 

and yS"^^ is the complementary event of Sn^- Let us first deal with Cj'g^j. In view of (A. 2), 
(A.3), (17) and (18), it follows that 



\Cl^N,i\ < ^ I \J'{H^)\dFj{x) 



< ^ / [Hr,{l-H^)]-'^+'dH^{x) 

J AT 

< 22 



Now using the Markoff inequality, we obtain 



where m > and may depend on e. Next consider C'^sjvj- Write i^i = H^{j}x^^ and 
= i^iv(^2iv). Then from (17) and (18), we have Hi ^ 1 - H2 < K/N. By (20), we are 
certain that ej^t ^ and 

+ [ [HM{l-HM)]-'^+'dHr,{x)) 
JH2 ' 

CjN Jo 

K , , 

Therefore, by using (21), we have 

Cisivj = Op(iV-^/2)_ ^24) 

Similarly, it can be shown that Cunj = Op{N~^). Consequently, we have 

Ci^j = o,(Ar-V2). 

Next, we consider C2N,j- By analogy with the first C term, we have 
C2NJ = jiHN- HN)J'{HN)d{Fj,n, - Fj){x) 

+^E^J^''4^ / xMx)J'{H^)d{F,,^,-Fj){x) 

i=i 

n'^^^A- ^ , r 

j xU{x)J'{H^)d{xax)) 



i=l 



+n7'/'^- j {Hn - HN)J'{HN)d{xfj{x)) + Op{N-') 
4 

= ^C2iN,j + 0p{N-'^), (say). 
1=1 

The proof of Caijvj = Op{N~^/^) is identical to that of Puri (1964). Next, we consider 

1 ^ 1/2 

C22N,j — ^ rij Ai{C22N,j + C22N,j)j 



i=l 



for which, it suffices to show 

C22ivj = / xU{x)J'{H^)d{F^,r.,-F^){x)^o,{l), (25) 

C*22N,j = / xU{x)J'{H^)d{F^,r.,-F^){x)^o,{l). (26) 

Note that from (A. 2) and (A. 3), we can find A' > such that \xfj{x)\ < KH^i^ — H^)- 
Then from (17), (18) and (22), it foUows that (25) is dominated by 

\C22N,j\ < kI \xf^{x)\\J'{H^)\\d{F^,^^-F^){x)\ 
< k! [H^{l-H^)]-'^^'\d{F^,r.,-F^){x)\ 

' j'^ 0{N'^-')\d[n]'\F,,^^ - F,){x)]\ = 0,(1). 



1/2 



Likewise, it is easy to show from (23) that (26) is dominated by 

HM - H^)]-2+'\d(F,.„. - F,)(x) 



< 


K 


a 










+ 










< 


^7 


1/2 



r 0(Ari-^)|d[nf (F,,„^. - F,)(x)]| = 0,(1). 
Jo 

Therefore, it foUows from (21) that C22N,j = Op{N-'^/'^). The proof for C23N,j = Op{N-^/^) 
is anafogous to (24). To complete the assertion for C2N,j, it remains to evaluate C24iVj ~ 
MC24N,j + C;:N,j): where 

C;4N,j = ^7'^'/ {nN-HN)f{Hj,)d{xfj{x)), 

CllNj = ^7'^'/ {nN-HN)J'{Hj,)d{xfj{x)). 



M5') ^ sup Z , ' :f ^ ;r .1-,. < C\ 5'>0, C*>0 (27) 



By virtue of Puri and Sen (1993, Theorem 2.11.10), write 

X [Hn{x){1-Hn{x))Y 

so that P{1n{^')) > 1 - e. Then, if we let 5' < S, it follows from (A.2)-(A.4), (22) and 
(27) that 

C*24Nj = \HN-HN\\J\HN)\dF,{x) 



< [ 0{N-'/')[H^{l-HN)]'-''-'dHN{x) 

9 JSn, 

< ^^^0(Ar-V2) f Hf,-''-'dH^{x) 



^3 



and similarly from (23) that 



Kn 



-1/2 



-0{N 



-1/2N 



Hi 



Hf,-'-'dHj,{x)^o{N-'). 



Hence, C2AN,j = Op{N ^/^). Consequently, we have 

Finally, we evaluate C^nj- Following the preceding term, and using 



J{Hn) + 



N + 1 



-Hn — Hn 



xJ'i^gHj, + {l- g)j^nN), 0<g<l, 



we obtain 



N 



N + 1 



Hn — Hn 



i=i ^ 

r / N - \ 1 

X \j'(gHM + (1 - ^)^^7iivj-J'(i^iv)J(iF,>,(x) 

r / iV \ 1 

X J'[gH^ + (1 - ^)--_7i^j-J'(i/^)Jc^(a;/,.(x)) 

X \j'[gE^ + (1 - ^)-^7i^)-J'(i/^)]d(x/,.(x)) + Op{N-^) 

4 

= ^C3ijvj + 0p(A^~^), (say). 



i=l 



First consider Csiatj = C'siatj + C's^atj; where 



C. 



31N,j 



+ 1 



7i 



N 



X 



J'i^gHr, + (1 - ^)^^7i^j-J'(i/^)JciF,-„.(x), 



A^ 



5c VAT + l 



X 



AT 



J' ( gHM + (1 - ^?)]^^^ ) -J'iH, 



IN 



To evaluate C^u^j, first note from (6), (A. 2), (A. 3) and (21) tliat 



N 



N 



N 



= N-'/'0{l){l + [H^{l-H^)]'^-''}, 
wfiere 0{1) is uniform in x. Then from (18) and (28), it follows that 

1 - (^gH^ + {1 - g)^^n^^ 



N + 

= (l-Hj,) + {l-g)(H^ 



N 



-n 



N 



N + 1 

(1 - Hj,) + 0(N-'/'){l + [Hn{1 - Hj,)]'^-''} 

(1 - H^){1 + 0{N-'/^)Hl-\l - Hj,)-'^-''} + 0{N-"^) 

(1 - Hj,){l + 0{N-^/'')0{N'^+y))} + C»(Ar-V2) 

(l-//^)(l + o(l)) + 0(Ar-V2) 

(l-//^) + C»(Ar-V2) 



or equivalently 



(l - {qHn + (1 - ^)^^7i^)) x(l - //^)-i = 1 + 0(7V-V2). 
Thus, for sufficiently large N > 0, we can find C > such that 



inf ( gHM{x) + (1 - ^) 



N 



N+l 



iiN{x) 



'l-{gHN{x) + {l-g)j^nN{x)) 



Hr,ix){l - Hj,{x)) 



with probability > 1 — e. Now write 

N 



< 



N+1 



Hn — H]s[ 



X 



J' 



gHM + {1- q) 



N 



N+1 



-n 



N 



-J'{Hj,) dF,-„.(x) 



= j QNdFj^njix), (say). 
Then it is easy to show from (A. 2), (22), (27), (29) and (30) that 
Ef QNdFj,^.{x) < X(l + C'-i)e»(7V-V2) 

X / [HN{l-HN)Y-''-^dH^ix) 

< K{1 + C'-'^)0{N-^/^) j\'-''-'dHMix). 



Thus, Qn{x) is integrablc and converges to in probability. Hence, by virtue of the 
dominated convergence theorem and (31), it is seen that C^^j^j = Op{N~^^'^). Similarly, 
we can show C^i^j = Op{N~^/'^) by using the arguments of (23) and (31). Next consider 



1 ^ 1/2 

1=1 



where 



c: 



32N,j 



/ xMx)[j'[ 



N 

qHn + (1 - q)j^—^T^n 



qHn + g) 



N 



N+1 



n 



N 



-J'{H^)}dFj,^^{x) 
-J'{Hr,)}dFj,n,(x)- 



Let us first evaluate C^2N,j- Recalling \xfj{x)\ < KHj^{l — Hr^), and using the arguments 
of C^iNj and (A. 2), we obtain 



E{\C;2N,j\) < / \xMx)\J'gHr, + {l-g)-—HN 
Js^, L N+1 . 

< K{l + C'-I) [ [Hj,{l-HN)]'-"2dHj,{x) 

J SNf 
1 



-J'iH 



Nj 



dFj{x) 



(32) 



In analogy with (23) and (32), we can show C^2N j ~ Op(A^ ^^^). Hence, from (21), we 
have C^2N,j = Op(iV~^/^). Next, we evaluate Cssatj = C'gg^j + Cg^^j, where 



C. 



^33N,j 



1/2 



{j'[gHr, + {l - g)j;^nN]-J'{HM)}d{xfj{x)) 

-J'{HM)]d{xf^{x)) 
Following the arguments of C'gj^jYj, and using (A. 2) — (A. 4), we obtain 

N 



N 



[j'[gHN+{l-g)jf^nN 



\C: 



33Afjl - 



X 



1/2 



A^ + 1 



J' 



gHN + (1 - ^) 



N 



N + 1 



n 



N 



-J\Hn) dFj{x) 



Kn 
< 1 



-1/2 



1 + C'-'^)0{N-'/') j'^Hf,-''-'dHj,{x). 



-l/2^ 



Therefore, C^snj = Op{N ^"). Similarly, in view of (22), we can show C^^nj = Op{N' 
Hence, by (21), we have C^^nj = Op{N~^/'^). To complete the evaluation of Csatj, we 



consider C34N,j = (C34JVj- + QliVj); where 



a 



N + 



n. 



1/2 



i=l 



X 



xfi{x) 



<^34iV,j 



i=l 



n. 



1/2 



xfi{x) 



f r 



X 



AT 



^]d{xfj{x)). 



We first turn to evaluate Cl^j^ ^. From (A.2)-(A.4), (21) and (32), it follows that 



i=l •''S'jVe 



X 



J' 



qHn + (1 - ^>) 



N 



k „i 



H 



N 



-J'{Hn) dFj{x) 



i=l '' N 



(33) 



Thus, C34JVJ = Op{N ^/^). By analogy with (23) and (33), we can show Cgljvj = 
Op{N~^/'^). Consequently, we have 

This completes the proof of the theorem. 
Acknowledgement 

The author is grateful to the associate editor and two referees for their insightful remarks 
and suggestions that greatly improved the original version of this paper. 



References 

[1] Bai, J. (1996). Testing for parameter constancy in linear regressions: an empirical 
distribution approach. Econometrica 64, 597-622. 

[2] Basrak, B., Davis, R. and Mikosch, T. (2002). Regular variation of GARCH processes. 
Stochastic Process. Appl. 99, 95-115. 

[3] Bauwens, L., Laurent, S. and Rombouts, J. (2006). Multivriate GARCH Models: A 
survey. J. Appl. Econ. 21, 79-109. 

[4] Berkes, I. and Horvath, L. (2002). Empirical processes of residuals. In: Empirical 
Proc. and their Appl.., 195-205. 



[5] Berkes, I. and Horvath, L. (2003). Limit results for the empirical process of squared 
residuals in GARCH models. Stoch. Proc. Appl. 105, 271-298. 



[6] Berkes, I. and Horvath, L. (2003). The rate of consistency of the quasi-maximum 
hkehhood estimator. Statist. Prob. Lett. 61, 133-143. 

[7] Bollerslev, T. (1986). GeneraUzed autoregressive conditional heteroskedasticity. J. 
Econometrics 31, 307-328. 

[8] Bougerol, P. and Picard, N. (1992a). Strict stationarity of generahzed autoregressive 
processes. Ann. Prob. 20, 1714-1730. 

[9] Bougerol, P. and Picard, N. (1992b). Stationarity of GARCH processes and of some 
nonnegative time series. J. Econometrics 52, 115-127. 

[10] Brockwell, P. J. and Davis, R. A. (1994). Time Series: Theory and Practice, 2nd ed. 
New York: Springer- Verlag. 

[11] Chernoff, H. and Savage, I. R. (1958). Asymptotic normality and efficiency of certain 
nonparametric test statistics. Ann. Math. Statist. 29, 972-994. 

[12] Durbin, J. (1973). Distribution Theory for Tests based on the Sample Distribution 
Function. Regional Conference Series in Apphed Mathematics 9, SIAM, Philadelphia, 
Pennsylvania. 

[13] Engle, R. F. (1982). Autoregressive conditional heteroskedasticity with estimates of 
the variance of U.K. inflation. Econometrica 50, 987-1007. 

[14] Engle, R. (1995). ARCH Selected Readings. New York: Oxford University Press. 

[15] Francq, C. and Zakoian, J-M. (2004). Maximum likelihood estimation of pure 
GARCH and ARMA-GARCH processes. Bernoulli 10, No.4, 605-637. 

[16] Gourieroux, C. (1997). ARCH Models and Financial Applications. New York: 
Springer. 

[17] Mikosch, T. (2003). Modeling dependence and tails of financial time series. In Extreme 
Values in Finance, Telecomunications, and the Environment (B. Finkenstadt and H. 
Rootzen, eds.), 185-286, Chapman and Hall, Boca Raton, FL. 

[18] Puri, M. L. (1964). Asymptotic efficiency of a class of c-sample tests. Ann. Math. 
Statist. 35, 102-121. 

[19] Puri, M. L. and Sen, P. K. (1993). Nonparametrics Methods in Multivariate Analysis. 
Wiley: New York. 

[20] Seber, G. A. F. (1977). Linear Regression Analysis. Wiley: New York. 

[21] Sen, P. K. and Singer, J. M. (1993). Large Sample Methods in Statistics: An Intro- 
duction with Applications. Chapman & Hall: London. 

[22] Straumann, D. and Mikosch, T. (2006). Quasi-maximum likehhood estimation in con- 
ditionally hctcroskcdastic time series: A stochastic recurrence equations approach. 
Ann. Statist. 34, 2449-2495. 



